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Abstract. Relying on the co-area formula, an exact relaxation framework for minimizing objec- 
tives involving the total variation of a binary valued function (of bounded variation) is presented. The 
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It is shown that thresholding a solution of the relaxed problem almost surely yields a solution of 
the original binary-valued problem. Furthermore, stability of solutions under data perturbations is 
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is considered. For the efficient numerical solution of the relaxed problem, a locally superlinearly 
convergent algorithm is proposed which is based on an exact penalization technique, Fenchel duality, 
and a semismooth Newton approach. The paper ends by a report on numerical results for several 
applications in particular in mathematical image processing. 
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1. Introduction. We consider the binary constrained minimization problem 



min H(u,v):^F(v)+ G(v)udx + pj(u), (1.1) 

where V is a reflexive real Banach space, F : V — > M is a (sequentially) weakly lower 
semicontinuous functional, which is bounded from below, G : V — > is a strongly 

continuous nonlinear operator for some p > 1, C R'', d G N, is a bounded and 
piecewise smooth open set. By BV{V,; S) we denote the space of functions of bounded 
variation, where {S; 6) is a locally compact metric space with some set S and a metric 
5 on S. According to [3], a Borel function u : 51 ^> S* is an element of BV{ft; S) if there 
exists a finite measure m such that m{B) > Jg \Dip{u)\ = \D(p{u)\{B) for all Borel 
sets B C f2 and for any Lipschitz function : S* — > R with a Lipschitz constant not 
larger than 1, i.e., ip e Lip]^(S'). Here, \Dip{u)\{-) represents the usual total variation 
measure of a function (p{u) e BV{fl); see, e.g., [19 . In fact, it is well-known that 
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w G BV{VL) iff w G L^{VL) and the total variation measure (SlZ-seminorm) 



/ \Dw 



— sup 



{ 




w divvdx :ve {c^{n)f, Halloo < 1 



} 



is finite. In tlie definition of \Dw\, {C(^{ft))'^ is the space of R'^-valued functions 
with compact support in D,. The space BV(fl) endowed with the norm ||w||_By(Ji) = 
+ Jo l-^^l ^ Banach space; see, e.g., [1^. In (jl.ip . we have {S;S) := 
({0, 1}; I ■ I) and the total variation (TV) of m G BV^il; S) is given by 



which is coupled to the remaining terms of the objective by /3 > 0. Here, A4 — sup 
defines the supremum over ct- additive measures over the Borel sets in J7; see 

Problems of the type considered in (jl.ip are of fundamental importance in many 
application areas. Indeed, in imaging science, both the TV-based binary image de- 
noising model [SI [2H] and the piecewise constant Mumford-Shah segmentation model 
[TTl [25] are respectively equivalent to (jl.ip with F = and G(-) = 5 G LP{ri) for 
p > where the binary variable u represents the binary image, i.e. the support set of 
image features, which is recovered from noisy data contained in g. In the optimal de- 
sign of composite membranes |12[ I13j , the minimization problem of the first eigenvalue 
for an underlying elliptic problem fits into the structure considered in where 
V is the corresponding eigenfunction and the binary-valued u represents the optimal 
configuration. Furthermore, (jl.ip also appears in the optimization of the topology of 
structures, e.g., in the well-known minimal compliance problem; see, e.g., [2j[5]. In 
the latter case, v denotes the elastic displacement and u the material density. 

Clearly, the binary constraint on w, contained implicitly in the definition of the 
underlying function space, poses significant analytical as well as numerical challenges. 
In fact, the binary constraint renders (jl.ll) non-convex and combinatorial in nature. 
Combined with the non-differentiability of H in the first variable (which is due to the 
presence of J) , the latter property challenges the development of efficient numerical 
solvers. Due to these mathematical difficulties and the wide applicability, recently 
problems with binary constraints have become the focus of research efforts in different 
model frameworks; see, e.g., [6l [9l [TOl [T2] . 

In this paper, starting from the general model (jl.ip we propose a relaxation 
framework for ()l.ip by replacing u{x) G {0,1} by the box constraint u{x) G [0,1]. 
Interestingly, it turns out that this relaxation is exact, in the sense that a minimizer 
of (jl.ip can be obtained almost surely from thresholding a solution to the relaxed 
problem with u{x) G [0, 1] for almost all x G We also provide a stability result with 
respect to perturbations of the operators G, and we incorporate volume constraints, 
which are important, e.g., in topology optimization. Numerically, the box constraint 
on u is realized by an exact penalization, and considering the nondifferentiable TV- 
term in the objective, a Fenchel duality technique is utilized to solve the relaxed 
problem on u. Specifically, the pre-dual to the relaxation is a quadratic optimization 
problem with box constraints, which is solved by an inexact semismooth Newton 
method. The resulting algorithm converges locally at a superlinear rate, and the 
efficiency of the proposed solver is demonstrated by means of several applications of 



The outline of this paper is as follows. In Section 2, we study the exact relax- 
ation of (jl.ip and stability results. Moreover, the situation with an additional volume 




n 



Du\ := M - snj> {\Dip{u)\ : ip G Lipi(S')} , 



dnD. 
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constraint is considered. Section 3 describes our algorithmic framework. Several ap- 
plications including numerical results are shown in Section 4, and, finally, conclusions 
are drawn in Section 5. 

2. Exact relaxation in BV-space. We start our analysis of by focusing 
on the binary constrained variable u and by considering an auxiliary problem. After 
its analysis we return to the full model considered in (jl.ip . 

2.1. An auxiliary problem. The auxiliary problem, which simplifies (jl.ip . 
consists of a linear term and the TV regularization only, i.e., we have 



where we assume that g G LP{n) for some p > 1. Obviously, (j2.ip is non-convex due 
to the binary constraint on u. Convexity is obtained by considering the relaxation 



The existence of a minimizer to ()2.ip and (|2.2[) . respectively, is addressed next. 

Proposition 2.1. Problem (12. ip and problem (|2.2p admit minimizers, respec- 
tively. 

Proof. For any u £ BV{Vt] S) with either S = {0, 1} or S = [0, 1] we have 



Thus, on sublevel sets of the objective functionals in p.ip and (|2.2p . respectively, the 
total variation is uniformly bounded. Since the elements are additionally bounded 
in L°°{n) ^ L^'ip.) for all q < oo, we obtain the weak-* compactness of level sets 
in BV{Vl.\ S) n L'^{VL). Based on the weak lower semicontinuity of J according to [3l 
Thm. 2.4] (see also [HI Thm. 1.9] for BV{VL)), choosing q = ^ we also obtain 
the weak lower semicontinuity of the objective functional and thus the existence of a 
minimizer in either case. □ 

The main result of this section. Theorem 12.21 below, states that the relaxation 
(|2.2p of (|2.1I) is exact, in the sense that minimizers of the original problem (|2.ip are 
obtained from minimizers of its relaxation. Theorem 12.21 also provides a constructive 
way to compute (almost surely) minimizers of ()2.ip by taking level sets of minimizers 
of (|2.2p . This result extends a corresponding observation in [9], where total variation 
based image denoising with an data fidelity and given image data corresponding 
to a characteristic function of a bounded domain is considered. For its proof we 
recall that the perimeter Per(i?) of a measurable set E (Z 0, relative to satisfies 
Per(i?) = \Dxe\, where xe denotes the characteristic function of E relative to Jl. 

Theorem 2.2. Let u G BV[Vt\ [0, 1]) be a minimizer of \2.2\ . Then, for almost 
every t e (0, 1), the function u* G BV{^1] {0, 1}) defined as the indicator function of 
the level set {u > t}, i.e., 




(2.1) 




(2.2) 





is a minimizer of (|2.ip and (|2.2I) . 
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Proof. Let u G BV{n; [0,1]) be a minimizer of (g^]) and v e SF(fi;{0,l}) be 
a minimizer of (|2.ip . Then, due to the coarea formula [18, Thm. I] and Fubini's 
theorem we have 

g udx + l3 Jiu) ^ g{x) Atdx + P i Vcy{{u > t}) dt 

Jn Jq Jo 



g{x) u\x) dtdx + l3 / J{u') dt 
Jo 

(/ 5 + •^("*)) 

> [ (^J g V dx + fi J(u)^ dt 
g V dx + /3 J{v). 



In 

Here we have used the fact that u* G BV{il; {0, 1}) to obtain the above inequahty. 
One immediately observes from the above estimate that 



g u dx + (3 J{u) > / g V dx + (3 J{v) 



if M* is not a minimizer of p.ip . which contradicts the minimizing property of u. 
Hence, we conclude the assertion. □ 

Based on Theorem 12. 2[ algorithmically we may find a minimizer of the binary 
constrained non-convex problem (|2.ip by solving its convex relaxation (j2.2p followed 
by a thresholding step. Further note that since the objective of (|2.2p is convex but 
not strictly convex, we cannot expect uniqueness of the solution to the minimization 
problem. However, in situations where the solution is indeed unique, we conclude 
from Theorem 12.21 that the minimizer of (12.11) is unique and is the same as for (|2.2p . 

Corollary 2.3. // the minimizer u of (|2.2p is unique, then u e BV{Q; {0, 1}) 
and u is the unique minimizer of (12. ip . 

2.2. The full problem. Next we return to the full problem (jl.ip . The associ- 
ated relaxed version reads as follows: 



min H{u,v):=Fiv)+ G{v)udx + l3Jiu). (2.3) 

(n,i,)esy(O;[0,i])xv Jn 

For the existence of a minimizer to (|l.ip and (|2.3I) . respectively, we first discuss the 
sequential weak lower semicontinuity of the objective functional. 

Lemma 2.4. Suppose that {v^} converges weakly in V to v V , and {u''} weak-* 
converges m BV{n; [0, 1]) n L°°(fi) to u € BV{n; [0, 1]) n i°°(rj) as k ^ +oo. Then, 
it holds that 

F{v) + I G{v)u dx + l3 J{u) < liminf ( F{v'') + [ G{v'')u^ dx + 13 J{u^) 

Proof. Due to the strong continuity of G, we obtain that G{v'') strongly converges 
to G{v) in L^{fl). Because of the weak-* convergence of u'^ in L°°{n), we have 

/ G{v'')u'^ dx ^ G{v)u dx as k +oo. 
Jn Jn 
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Since F is sequentially weakly lower semicontinuous on V and J is lower semicontin- 
uous in the strong topology of L^(ri), we obtain 



liminf F / G{v'')u'' dx + P J{u'') 

> liminf + lim [ G{v'')u'' dx + ^ \iuiM J{u'') 

> F{v) + I G{v)u dx + P J{u), 

Jn 

which completes the proof. □ 

Another important property in the analysis of the minimization problem is the 
compactness of level sets associated with the objective function. 

Lemma 2.5. Assume that the set 

{Fiv)~e\\Giv)\\% \veV} 

is precompact in the weak topology of V for some e > or for e = with the additional 
condition that G(-) > 170 for some go € R. Then the sub-level sets of the functional H 
are precompact in the weak topology of V with respect to v and in the weak- * topologies 
of BV{VL) and L°°{fl) with respect to u, i/O < m < 1 almost everywhere (a.e.) in f2. 
Proof. We first study the case e > 0. Using Holder's inequality, the fact that 



< / udx<\Vt\ 
Jn 



which is due to < u < 1 a.e. in il, and Young's inequality, we estimate 

H{u,v) > F{v) ~ \\G{v)h4u\\L. + P J{u) > F{v) - e\\G{v)\\h - ^ +/? J(u), 

where denotes the volume of the set ft and p^^ + q^^ = 1. Since J{u) is non- 
negative, we conclude that boundedness of H(u, v) implies boundedness of F{v) — 
4Giv)\\h. 

In the case where G(v) > go we estimate 

H(u,v)>F{v)+ I min{0,.go} dx + J{u) > F{v) - \go\\n\ + P J[u), 
Jn 

and thus, boundedness of -ff (u, v) implies boundedness of Fiv). Under the above con- 
ditions, we obtain in both cases precompactness with respect to v in the weak topology 
of V on sublevel sets of H . Due to the strong continuity of G, |jG(f;)||ii is bounded by 
some constant ci > on a weakly compact set in V, and thus /^^ G(v)u dx > —ci\fl\. 
Together with the fact that F is bounded from below, we obtain that J is uni- 
formly bounded on sublevel sets of H. Moreover, we have < 1, which implies 
< and hence, ||u||By is uniformly bounded. As a direct consequence we 
obtain the weak-* precompactness with respect to u. □ 

Now we have verified the assumptions of the fundamental theorem of optimization 
for the functional H, and as a consequence we establish an existence result. 

Corollary 2.6. Under the conditions of Lemma \2.5\ there exists a minimizer of 
the functional H on BV{^; [0, 1]) x V as well as on BV{^1; {0, 1}) x V. 
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The proof of Corollary 12.61 utilizes the usual infimal sequence argument relying 
on weak-* respectively weak convergence of a subsequence of {u''} and {v''} and the 
weak lower semicontinuity result of Lemma l2.5l as well as [3! Thm. 2.4] and fW, Thm. 
1-9]. 

We finally derive the main result of this section, a relaxation property similarly 
to the one obtained in Theorem 12.21 

Theorem 2.7. Let the assumptions of Lemma \ 2.5\ he fulfilled and let {u,v) G 
BV{VL] [0, 1]) X V 6e a minimizer of (j2.3l) . Then, for almost every t g (0, 1), the pair 
{u* ,v) £ BV(fl; {0, 1}) X V, where it* is the indicator function of the level set {u > t}, 
i.e.. 



1, if u > t, 
0, else, 

is a minimizer of p.ip and (12.31) . 

Proof. Since {u,v) G BV{ft; [0, 1]) x V is a minimizer of (12.31) . then in particular 
M is a minimizer of 

min H(u,v), 

ueBVifl-.lO,!]) 

which is equivalent to the auxiliary problem (j2.2p with g := G{v). Hence, from Theo- 
rem [52] we conclude that for almost every t the indicator function u* is a minimizer, 
too. From this, we infer 

H(u*,v) = H(u,v) ^ inf H(u,v) < inf H(u,v), 

{u,v)eBV{n:[0A])xv {u,v)eBV {n:{o,i})xv 

and, thus, {u*',v) is a minimizer of and (|2.3p . □ 

2.3. Stability. In the following we investigate the stability of (|l.ip with respect 
to perturbations of the operators. Since the stability with respect to v can be obtained 
under standard conditions in Banach spaces |29j , we restrict our attention to stability 
of the auxiliary problem (|2.ip and its relaxation (|2.2p with respect to perturbations 
of g. For this purpose, we state an interesting preliminary result concerning the 
equivalence of (|2.ip to the optimization of an objective functional consisting of a 
quadratic term and the total variation functional. 

Lemma 2.8. For g € L^{fl), (|2.ip is equivalent to the regularized least-squares 
problem 

If I 

min - {u + g - -fdx + P J{u). (2.4) 
«ew(n;{o,i}) 2 ^2^ 

Proof. Let a; G fi. For u{x) G {0, 1} we have u{xY — u{x) and thus 

uixf^2uix)[gix)-'^^uix)gix). 

Since this equality holds almost everywhere in Q, we deduce 

J^g udx^ ^ Jj.^ + a - ^fdx - \ j^ia- 

Hence, the objective functionals in (|2.ip and (|2.4p only differ by a constant term and 
consequently their minimizers are equal. □ 
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Using the results of [1_ for regularized least-squares problems of the form (j2.4p 
in BV{fl) together with the closedness of BV{^1; {0, 1}) with respect to strong L^- 
convergence (cf. e.g. |15l), we can immediately deduce a stability result for the 
geometric problem. 

Proposition 2.9. Let ^ g strongly in L^{Q) and let u'' E BV{il; {0, 1}) de- 
note a minimizer of (j2.ip with g replaced hy g^ . Then, {u^)k has a strongly converging 
subsequence in LP{Q) for each p g [l,oo), and each limit of a converging subsequence 
is a minimizer of (|2.ip . 

As a direct consequence of Proposition l2.9l and Theorem l2.21 we obtain the follow- 
ing stability result for the level sets of minimizers of the perturbed relaxed problem. 

Corollary 2.10. Let g^ g strongly in L'^{n) and let u'^ e BV 
denote a minimizer of (j2.2p with g replaced by g^ . Then, for almost every t G (0, 1), 
the indicator function of {u*' > t} has a strongly converging subsequence in LP{il) for 
each p G [1, cxd), and each limit of a converging subsequence is a minimizer of (|2.ip . 

2.4. Quadratic penalization uniqueness. Since the objective functionals 
in (|l.ip and (|2.ip are not strictly convex with respect to u, as stated before we can 
expect uniqueness of solutions neither for these problems nor for their relaxations. 

In order to obtain an approximate problem with a unique solution, it seems 
natural to add a quadratic regularization term with a small parameter e > 0. For this 
reason, we consider 



with a given u £ BV{rt; [0, 1]). We have the following existence and uniqueness result. 

Proposition 2.11. For e > the optimization problem (|2.5p has a unique 
solution u" G BF(rj; [0,1]). 

Proof. First of all, existence of a minimizer can be concluded by similar arguments 
as for (12. 2p . Since the first two terms of the objective functional are convex, and the 
additional quadratic term is strictly convex, we conclude uniqueness of the minimizer. 
□ 

Due to the additional quadratic term, the relaxation property of Theorem 12.21 
does not hold for (|2.5p . Consequently, the minimum in BV{ft; {0, 1}) might be strictly 
greater than the one in BV{fl; [0, 1]). However, we are only interested in (|2.5p as an 
approximation to problem ()2.1|) and therefore study the convergence of level sets as 



Theorem 2.12. For every sequence — there exists a subsequence (again 
denoted by {ek)k) such that for almost every t G (0, 1), the sequence {u'^'"^)k, with u'^'"* 
defined as the indicator function of the level set {u'^'' > t}, converges to a solution 
of problem (12. ip in the strong topology of L^{VL) for each p G [1,cxd). Moreover, if 
for any sequence — J> 0, the sequence it'^'''* converges, then the limit is a solution of 
problem ([2T2|) . 

Proof. Let u° be a minimizer of (|2.2p . Based on the definition of u'^ and u° , we 
conclude 




(2.5) 



e ^ 0. 



/ 



g u" dx + 13 Jiu") + - 




[ {u' - iifdx < j gu° dx + p J{u°) + i f {u°- u)^dx 
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and thus, 



{u' - uydx < {u°- uYdx, 

n Jn 

which imphes in particular that Iju*^ — u\\l2 is uniformly bounded in e. Let g'^ :- 
g + e(u'^ — u), then is also a minimizer of 

g*^ u dx + P J{u) subject to (s.t) u{x) £ [0, 1], 



mm 

ueBV(n)jQ 

which can be seen from the associated necessary and sufficient optimality condition. 
Since 

\\g'-g\\L^^e\\u^-u\\L2=Oie), 

we can employ Corollary 12. 101 to conclude the assertion. □ 

2.5. Volume constraints. In many applications, volume constraints on u are 
relevant. A particular instance occurs in topology optimization, where such a con- 
straint fixes the amount of material available for designing an optimal topological 
structure; see section [4.3.21 below for an application in minimal compliance topology 
optimization. 

In view of this discussion, the model (jl.ip is augmented by adding the volume 
constraint 

u dx = V, (2.6) 

n 

with fixed 0<V <\n\. 

We observe that the range space of this constraint is finite (actually one-) dimen- 
sional. As the following example highlights, constraints in function space could be 
problematic. 

Example 2.13. Let V = L^i^l), F{v) = J^\v ~ dx, and G{v) = 0. We 
consider the relaxed problem (j2.3p subject to the constraint v = u in L^(f2). Obviously, 
the objective functional vanishes for v ~ u = i, and it is positive for any other 
{u,v) £ BV{^;[0,1]) X L^(r2). Thus, v = u = ^ is the unique minimizer, and in 
particular, the infimum over BV{n] {0, 1}) is strictly larger than 0. 

Possibly the main reason for the fact that the "exact" relaxation result does not 
hold for general constraints is a potential nonexistence of Lagrange multipliers for the 
binary- valued problem. In the case of the volume constraint (j2.6p . however, we can 
verify the existence of a (scalar) Lagrange multiplier and use it to prove an equivalence 
result for the relaxation in this case. We start again with the minimization of the 
auxiliary problem from above, but now subject to (j2.6|) . 

Theorem 2.14. Let u £ BV{fl; [0,1]) be a minimizer of ([2^ subject to ([T6l) . 
Then, for almost every t £ (0,1), the function u* £ BV{VI;{{),1}) defined as the 
indicator function of the level set {u > t}, i.e.. 



u\x) 



1, if u > t, 
0, else. 



is a minimizer of (|2.ip and (12.21) subject to (j2.6|) . respectively. 
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Proof. In the following let (for A G M) denote a minimizer of the problem 

min (g + X) u dx + (3 J{u). (2.7) 

Existence of follows from Proposition 12.11 Moreover, we can apply the above re- 
laxation results and therefore we may assume that G BV{n,; {0, 1}). By comparing 
objective functionals, it is easy to show that we have for e > 



u 



dx < dx. 



Further, using the weak-* compactness in L°°{^) n BV{ft; {0, 1}) we can prove for 
fik ^ X that a subsequence {u^^'')k converges to as k ^ +oo in the weak-* topology 
of L°°{n) and, thus, in particular it holds that 

u'^'' da; -> / u'^ dx. 
1 Jn 

Hence, we conclude by standard arguments for real functions that the map 

: R ^ M, dx 



is well-defined, continuous and non-increasing. As A — ?> oo we must have m'*' da; ^> 
due to uniform boundedness of the objective functional, and for A — >■ — oo we obtain 
JqU^ dx — > |r2|. Since is a continuous function on R, we conclude that for each V G 
(0, \n\) there exists A £ R such that W{X) = V. Moreover, for aU u G BV{n; [0, 1]) 
with J^u dx = V we have from the definition of 



gu"" dx + (3 J{u^) = {g + A) da; -I- ^ J(u^) - XV 
Jn 

< I {g + X)udx + l3 J{u) - XV 
g u dx + j3 J{u). 



Thus, is a solution of the relaxed problem (12. 2p subject to 

Now take any solution u G BV{Vl\ [0, 1]) of (|2.2p subject to (|2.6p . Then again it 
holds that 



[g + X) dx + P J{u-^) = / {g + X)udx + l3 J{u) 



and hence, u is also a solution of p.7p . Now we can apply Theorem [221 to this problem 
and show that the indicator functions u* are solutions, too, for almost every t. Then, 
by the above argument, -u* is also a solution of (|2.2p subject to (|2.6p . □ 

By reasoning analogous to the one in the proof of Theorem 12.71 one shows the 
following result. 

Theorem 2.15. Let the assumptions of Theorem \2.7\ be fulfilled and let (u, tJ) G 
-BV^(r2; [0, 1]) X V be a minimizer of (|2.3p subject to (j2.6p . Then, for almost every 
t G (0, 1), the pair {u^ ,v) G BV{Cl; {0, 1}) x V, where w* is the indicator function of 
the level set {u > t}, i.e., 



u\x) 



1, ifu>t, 
0, else. 



is a minimizer of (|l.ip and (12.31) subject to (j2.6p . respectively. 
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3. The Fenchel dual problem and dual semismooth Newton. This section 
is devoted to the study of an efficient algorithm for solving the convex relaxed auxihary 
problem (|2.2I) with g £ LP{n) for some p > 1. 

In order to remove the box constraint in (|2.2p . we add a penalty term to obtain 
an unconstrained minimization problem of the form 

' ' ij dx + Z? J{u) (3.1) 

with a penalty parameter c > 0. Note that the max-term penalizes violations of 
u £ [0, 1] in (j2.2p and yields an exact penalty [7l[23], i.e., there exists c* > such that 
the solution to (|2.2|) solves ()3.1|) . as well. 

In order to have uniqueness of a solution, which is not guaranteed for p.l|) . we 
consider next 



/ g u dx + c max I 0, 2 


1 




"~2 


-0 


Jn JQ \ 







/ g u dx + c max I 0, 2 


1 




"~2 


-0 


'n Jn \ 







+ I / \u-l\'dx + - I \u\'' dx + f3 J(u), (P) 



with arbitrary, but fixed e > 0. Note that the additional quadratic terms (depending 
on the magnitude of e) prefer solutions with u{x) e {0, 1}. 

In view of the non-differentiability of the TV regularization and the penalty term, 
respectively, we now study a dual formulation. For this purpose we formally compute 
the Fenchel dual of and set A := V as well as 

J-{u) :— j g u dx + c j max ( 0, 2 u — - — 1) d2;+- / |u— Ipdx-I-- / jitp da;, 

(3.2) 

and further 



g{p) [ IpIi dx, 

Jn 



where | • 1 1 denotes the Zi-vector norm. Based on the definition of the Fenchel conjugate 
[TT] . we get 

T*{u*) =^[||min(u* - .g + e + 2c,0)||2 + ||max(u* -5 + e,0)||2 

+ II max(7.* _ .g - e - 2c, 0)||2 - || max(u* - g - e, 0)f ] - ||f]|, 

where 

f^.fO, if - 131 <p'{x) < 131 ioT a..e. X €n, 
H-ptpi]yP ^ " \ oo, otherwise. 

Thus, formally the dual of (0 is given by 

min — [11 min(divg - 5 + e + 2c, 0) |P + || max(divg - .g + e, 0)|P 

|g(a:)|oo<0 a.c. in f2 4e 

+ II max(divg-- 5 - e - 2c, 0)||2 - || max(divg-'- 5 - e, 0)f] - ||f}|, (P*) 
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where q — —p*. 

Based on first-order optimality conditions, solutions u and q of ([P| and ([P 
respectively, satisfy 

^(div(f — g + e + 2c), if divg — g < —e — 2c, 

0, if divg — g G [— e — 2c, — e], 
^{divq-g + e), if |div(f- < e, (3.3) 

1, _ if div^-.g e [e, e + 2c], 
^(divg — 5 + e — 2c), if divg — g > e + 2c, 

l^/sfr^l on {a; : u^.(a;) ^ for alH}. (3.4) 

We observe that if e is sufficiently small and c is sufficiently large such that |div(f— g| is 
always in [e, e + 2c], then the expected solution u S {0, 1} is found directly. However, 
for X e r2 with u{x) e {0, 1}, we only get the characterization e < |div<f(x) — g{x)\ < 
e + 2c, but we do not obtain exact values for div(f(a;) — g{x). In order to cope with 
this incomplete characterization of divg — g, we propose to add a regularization term 
to the objective of (IP*[) yielding 

min — ||div(f — g\\\ H [|| min(div(f — g + e + 2c, 0)||2 

[(jCa:) loo </3 a.e. in £1 

+ II max(divg - 5 + e, 0)||2 + || max(divg - 5 - e - 2c, 0)||2 
-||max(divg-5-e,0)||2] -l|r!|, {P;) 

where 7 > is the dual regularization parameter. To understand the structural 
impact of the 7-regularization, we study the dual of ( P* ) and relate the resulting 
problem to ((P|) . For this purpose we introduce the function 

j;(u*) ^l\\u* - g\\l + 1 [II min(u* - .g + e + 2c, 0) \\l + \\ max(«* - 5 + e, 0)11^ 
+ II max(7.* - 5 - e - 2c, 0)||2 - || max(u* - 5 - e, 0)||2] - 1 |f]|. 
Based on the definition of the Fenchel conjugate we obtain the dual of F* as 



:F;*{un ^J^u**gdx- ^^^^ I ^^^^ [II min(^** + (e + 2c)7 



+ II max(w** + 67, 0) ||^ + || max(M** - 1 - (e + 2c)7, 0) ||^ 

- II max(^** - 1 - 67,0)11^] + ^h**f + ||f^|. (3.5) 

Comparing J-** with J- in the original problem ((P)) . we come to the following conclu- 
sion. 

Proposition 3.1. As 7 tends to zero, J-** defined in ()3.5p converges to T defined 
m (lO). 



Proof. For making the relation between J-** and J- more explicit, we define 

Qi ^{x e n : u{x) < -(e + 2c)7}, ={x G ft : u{x) G (-(e + 2c)7, -£7)}, 

^3 ^{x e ft ■■ u{x) e [-67, 1 + £7]}, ^4 ={x e ft -. u{x) e (1 + £7, 1 + (e + 2c)7)}, 
fl5 ={x e : u(x) > 1 + (£ + 2c)7}. 
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Obviously, we have fl = [Jl^i ^i, and Hi f^Hj =0 for any i ^ j with i,j = 1, • ■ • ,5. 
Then, J-** can be equivalently written as 

J-**(u) — f u q dx -\ — / niax(0, 2\u — -I — 1 — 2ej) dx 

^ ' ' Jn 1 + 267 70 2' 



1 



1 + 267 



n 

2 1 4^2 



I \u-l\^ dx + ^ I \u\^ dx 



2(1 + 267) ' 



1 + 267 J.. ^27 - + + + - - ilTT^I"^"- 

Based on the definitions of ^2 and $14, we have 

6^7 1 (e + 2c)^7 
— < 2^^" < 2 ' 

— <^(w-i) < ^ , Vxen^. 

Hence, when 7 goes to zero, these two terms in u are uniformly bounded. Furthermore, 
the integrals on and 1^4 in the above definition of T** as well as the terms involving 
i S {1,3,5}, tend to 0. Thus, we obtain 

7— >o ' 

This concludes the proof. □ 



Now we return to problem (P*) with q € _H'o(div) := {v S L (fi) : diviT € 
L^{il), V - n = on d^l} where n is the outward unit normal to d^l. We equip ffo(div) 
with the norm = \W\\i,2 + \\divv\\2- Then, standard arguments prove that 

(P* I admits a solution in Ho{d\v). 

Since the divergence operator has a nontrivial kernel, the solution of (P* ) need 
not be unique. The following problem, on the other hand, has a unique solution: 

7 9 1 

min -||div(f-5||^ + — [||min(div(f-.g + e + 2c,0)||^ 

q G Ho{div) ^ 46 

|^C^)|oo ^ a.c. in SI 

+ II max(divg — .9 + 0)||2 + || max(divg — ^ — e — 2c, 0)||2 
-||max(div<r-5-e,0)||2] +^||PdivqlL, (3-6) 

Above, Pdiv denotes the orthogonal projection in L^(il) onto i/o(divO) ;= {v € 
Ho{div) : divv = a.e. in 

Proposition 3.2. The solution of (|3.6p is unique. 

Proof. Since the set of feasible g in p.6p is convex and the term || min(div(f— g+6+ 
2c,0)||i + ||max(divg-5 + e,0)||i+||max(divg-g-6-2c,0)||i-||max(divg-5-e,0)||i 
is convex but not strictly convex, the main target now is to show that Q{q) 
^||div(f— 5II2 + f ||Pdiv'?1lL2 is strictly convex. Calculating the second derivative of Q, 
we get 

Q"{q,q) = -f\\dWq]\l + A||Pdivg1lL > ^Mw^) 
12 



for a constant k > independent of tf G -ffo(div). From this we infer the assertion. □ 
Remark 3.3. In our numerics we frequently set A = without running into 

difficulties due to non-uniqueness. This suggests that the box constraint \q{x)\oc < /? 

often yields uniqueness. 

For solving (13. 6p and in view of [21] (see also |22]) we consider the approximating 

unconstrained minimization problem 



7 



g-GHi(n)2 



i[||min(div(f-g 



11 max(div(f 
1 

2^' 



9 - 
a 



+ 2c,0)\\l 
2c, 0)||2 - II max(div(f- 



|max(div(f-5 + e,0)||2 
A 



e,0)| 



iPdivgl 



2 

L2 



max(0, (q- m\h + f II min(0, {q + m\h, 



where a > 0. Note that (-P^^ I and (|3.6p differ in that we add three quadratic terms 



weighted by a > in (_r* I. Furthermore, we keep here the same notations for 7 



and A although these quantities equal the corresponding ones in p.6p . but in (IP* 



they are multiplied by 2e, respectively. Straightforward arguments yield that the 
minimization problem (P* ^ \ tends to (P^ ) as a -t- 00. The unique solution q of 
\P^ ci I satisfies the first-order optimality condition 



Q{q) := —V [7(div(f — g) + min(div(f — g + e + 2c, 0) + max(div(f — 5 + e, 0) 
+ max(div(f — 5 — e — 2c, 0) — max(div(f — 5 — e, 0)] + APdiv<f 

--A^+ max(0, a[(f - (3)) + min(0, a(^+ /?)) = 0. (3.7) 
a 

It is well-known that the max- and min-operators are semismooth in our function 
space setting. This function space property of Q(-) was the sole reason for adding 
(2a)~^ II V(7||l2 in (P^.^l- In a purely finite dimensional approach one may discard 
this term. As a consequence, for the efficient numerical solution of (|3.7p . we utilize 
the semismooth Newton technique of ^U] . 

In our numerics we use the following generalized derivatives of the max- and the 
min-operations above: 



D max((^, 0)(x) 



1, 
0, 



if ip[x) > 0, 
otherwise. 



and D min((/3, 0)(x) 



if ip{x) < 0, 
otherwise. 



Then the generalized Newton step 6qk & Hj(ri) at the current iterate qt £ Hg(fi) is 
the unique solution to 



with the sets 



1 



XAii - XaI ) div + APdiv A 



a 



Sq = -Q{qk), 



A'i ^{divqk - g < ~ 2c}, 
A3 ={divqk-g > e + 2c}, 
Al ^{q-k > /?}, 



Al ={divgfe - .g > -e}, 
A\ ={divgfe - 5 > e}. 
At ={g-fc < -/?}. 

^g*. Using the results of 20 it can be 



The next iterate is defined as g*+^ 
shown that the above iteration converges at a superlinear rate to the solution g of 
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( P* ^ I provided that the initial point (f is sufficiently close to q. We mention that the 



latter locality requirement of the initial point seemed not restrictive in our numerical 
practice; see below. 

Upon discretization, keeping the same notations as above, we have g G M" , q* € 
M''" and, for A C {1, • • • ,m}, XA G K^x™ denotes a diagonal matrix with the 
diagonal entries ti = 1 ii i ^ A and ti — else for i = 1, ■ • • , m. Also, we have 

Ai ^{i : (divgfc), - < -e - 2c}, A2 ={i : (divfffe), - g^ > -e}, 

A3 ^{i : (divgfe), -g,> e + 2c}, ={i : (div^), - g, > e}, 

={z : {qk)^ > /3}, 4 : (<?!.). < -/?}■ 

Further, / S ]R''"X''" denotes the identity matrix, div = -V^ £ K«xd«^ ^^^^j 
" A 



A = 



V 











A 



G 



sdn X dn 



A e 



T,dn X n 



with i = 1, 



In our numerics, we discretize the Laplace operator by the standard five-point stencil 
with homogenous Dirichlet boundary conditions and use forward differences for the 
gradient and corresponding discrete adjoint scheme for the divergence operator. 

Notice that the system matrix of the semismooth Newton step is symmetric and 
positive definite. Consequently, we use the preconditioned conjugate gradient (PCG) 
method to solve the associated linear system. Upon convergence of the discrete New- 
ton iteration, we obtain the solution u to the discrete counterpart of (fPt based on 
the discrete version of 



4. Applications. In the following we discuss several important applications of 
optimization problems with shapes as the unknown and apply the above relaxation 
results. Moreover, associated numerical results allow us to study the behavior of our 
semismooth Newton algorithm. 

4.1. The Rudin-Osher-Fatemi model for binary image denoising. An 

important model for restoring images from data corrupted by Gaussian noise is the 
Rudin-Osher-Fatemi (ROF) problem [55] 



mm — (u 

.Gsy(n) 2 7n 



ff dx + P J{u) 



(4.1) 



where /3 > is a suitably chosen regularization parameter. If one is interested in 
binary images (e.g., black and white images), then the minimization process is con- 
strained to M S BV{n; {0, 1}). Clearly, the resulting problem is non-convex. However, 
as we shall see below, by using the techniques above there are several ways of relaxing 
(|4.1[) for binary images to obtain a convex problem. 

Chan and Esedoglu [9j considered a version of the ROF-functional for binary 
images, where they replaced the quadratic fidelity term by the L^-functional \u — 
f\ dx. Clearly, if u and / are binary, i.e., they take values in {0, 1} only, then 
\u{x) — f{x)\ € {0,1} for almost every x and hence, the L^-functional is equivalent 
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to the L^-functional, since \u — f\ — |u — /p. Using a proof technique similar to the 
one above, they could show that the relaxation of the L^-functional to BV{il.; [0, 1]) is 
equivalent in the sense that indicator functions of almost all level sets of minimizers 
are solutions of the problem in BV{il; {0, 1}). A disadvantage of this relaxation is the 
additional non-smooth term, which increases the computational effort in minimizing 
the relaxed functional. 

Instead, we consider here a linear relaxation of the first term, using the equivalence 
result of Lemma For this purpose we start with u G BV{ft; {0, 1}) and obtain 

Thus, the binary valued problem 

min - I [u- ff dx + B Jiu) (4.2) 

is equivalent to p.ip with g = ^ — f . Consequently, Theorem 12.21 applies, proving 
that a minimizer of (|2.2p with the above choice of g and for / G L^(f2) almost surely 
yields a solution of (14. 2p upon thresholding. 

Note that in contrast to the analysis in [5] we do not need to confine the data / 
to be binary, but rather consider arbitrary L^-functions as data. In order to illustrate 
existence and uniqueness in the model, below we consider two special cases in G 
(-1,1)- 

Example 4.1. Let f3 < \ and / be any function satisfying f{x) — 1 for x <Q, 
f{x) <\forx>Q. Then 



min I []-- f ] u dx + (3 J{u) (4.3) 



«ew(n;[o,i])7jj V2 
have a unique solution given (in an almost everywhere sense) by 



u{x) ~ 

This can be seen as follows. We have 



1, if X < 0, 
0, else. 



\-f^ udx + l3J{u)^ j ^(^--f^ dx + /3 = /3-i. 

For arbitrary u G BV{^1] {0, 1}), we have 

1 \ /"Vl \ /"Vl \ 

— — f ] u dx — o^/ dx + 77^/ '^2; 



2 / \ 2 / ' Jq y2 ^ 

<'o >o 

Thus, a smaller objective value than for u can only be achieved for functions u G 
BV{VL\ {0, 1}) with J{u) < J{u) = 1. The latter is possible only for u = Q or u = 1. 
For u = Q, we have 



^ Q - udx + P J{u) = > /3 - i. 
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and for u = 1 we find 



1 \ /"Vl .\ . 1 „ 1 



2 f\udx + pj{u)= \--f\dx>~->P^. 



Hence, u is the unique solution of 

min / I f) u dx + B J(u). 

ueBV(n;{0M}Jn V2 / 

Due to our equivalence and uniqueness results u is also the unique minimizer of the 
relaxed problem (|4.3p and of (|4.2p . 

Example 4.2. Lef < e < 1, /3 < and f be defined by 

1, if X < — e, 
/(x) = <( i, if - £ < a; < £, 
0, if X > e. 



Then, each function u satisfying 

u{x) 



1, if X < — £, 
0, if X > £, 



is a solution of (14. 3p . 

/n /aci, /or eac/i u defined as above, we have 

l-f) udx + l3J{u)^ [ '(l-f) dx + l3^l3-hl-e). 



Due to the above results, the relaxed problem (j4.3p has always a solution in BV{fl; {0, 1}), 
so that it suffices to consider functions in this class. For arbitrary u £ BV{^1] {0, 1}), 
we have 

\~ u dx^ J ^ [\- dx + J Q " 

<o >o 
>/_;Q-/) dx^^\il-e). 

Again, a smaller objective value than for u can only be achieved for functions u G 
BV{VL; {0, 1}) with J{u) < J{u) — 1, i.e., u = Q or u = 1. For u = 0, we have 

^ - /) udx + /3 J{u) = > /3 - 

and for u = 1 we obtain 

\-f^ udx + 13 J{u) = > /3 - iy^. 
Thus, u is a solution of (j4.3p . Furthermore, we readily conclude that each function 
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is a solution of (j4.2p if u ^ (—£,£). Therefore, in this case there is no unique recon- 
struction of f . 

Now we study the behavior of our algorithm for restoring binary images corrupted 

for 



by Gaussian noise. In Figure |4~T] we show the resuhs obtained by solving (P*^a 
restoring degraded images with 30% and 50% Gaussian noise, respectively, i'br all 
tests related to this example we fix c = 100, e = 10~^, 7 = 0.1, and a = 10'^. We stop 
the semismooth Newton iteration as soon as the initial residual is reduced by a factor 
of 10~®, or the difference between two successive residuals was smaller than 10~*. For 
solving the linear system occurring in each Newton step we utilize the PCG-method. 
The preconditioner consists of the Laplacian and the terms involving the indicator 
functions of A'l and A\. The stopping tolerance for the PCG-method in iteration 
/c + 1 is given by 

, , in-3 If resfe^^^ resfc 1 

tolfc+i = 10 • mm <^ , ) , 

I^Vresoy resoj 

where res^ denotes the residual of the linear system (13.71) at qt. Thus, we use an 
inexact semismooth Newton iteration. 

For the reconstruction we explore different values of P, which controls the trade- 
off between a good fit of / and a smoothness requirement due to the total variation 
regularization. From the results shown in Figure 14.11 we find that small P preserves 
details, but at the same time some noticeable noise remains; otherwise, if P becomes 
large, details are over-regularized. As we fix sufficiently small e and large enough c 
in our tests, we obtain the expected binary reconstruction u directly. This confirms 
our earlier observation in connection with p.3p . In all other cases, we simply set 
t ~ 0.5 and obtain the binary result as proposed in Theorem 12.21 Concerning 
the algorithmic behavior, from the residual plots we observe that despite the inexact 
linear system solves our algorithm only requires a rather small numbers of iterations 
until successful termination. This fast convergence can be attributed to the use of a 
semismooth Newton solver. Moreover, the iteration numbers are found to be stable 
with respect to different noise levels. 

4.2. Mumford-Shah-based binary image segmentation. In [TI], an active 
contour model based on Mumford-Shah segmentation techniques was proposed. It 
distinguishes objects from the background by solving 

min / [u(ci - ff + (1 - «)(C2 - dx + P J{u), (4.4) 

where u e BV{il; {0, 1}) is the indicator function of the set $7°''^ , which corresponds 
to the detected objects, and ft \ refers to the background set. Thus, for such a u 
the model (|4.4p yields a piecewise constant approximation /*^'^ of / with /*^'^ = ci 
on and Z*^'^ = C2 on 51 \ . Here, ci and C2 are the average values of / on the 
set and Q. \ , respectively. 

Obviously the key step to solving (|4.4p is to obtain the optimal indicator function 
u. We note that for fixed ci and C2 the minimization problem (14. 4p with respect to 



u is equivalent to (|2.ip with g = {ci — fY — (c2 — fY- Hence, we can apply our 
framework to solve this problem. 

In Figure 14. 2[ we display the resulting solutions u when segmenting the specified 
test images with different choices for p. In all tests, the choices of all other parameters 
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(d) (e) (f) (g) 

Fig. 4.1. (a) Original image, (b) noisy image with 30% Gaussian noise, (c) noisy image 
with 50% Gaussian noise. The results shown in the second row are from 30% noise; the 
ones shown in the third row are from 50% noise, (d) Restored images with /3 = 10^"', (e) 
the corresponding residual plots, (f) restored images with /3 = 10~^, (g) the corresponding 
residual plots. 



and the stopping rules are as in the previous section. InitiaUy, we set vP{x) = 1 if 
f{x) > 0.5 and u°{x) = otherwise. This gives n"''' = {x G : f{x) > 0.5}. Based 
on the initial u", we then calculate cj and 0° by averaging the data as discussed above. 
Algorithmically, whenever we obtain a new u'' by solving (|2.ip , then we update and 
C2 based on u'' and go to the next updating step for u until — < 10~^|fi|. 

Since the total variation of an indicator function on a set is related to the perimeter of 
the pertinent set [H], the resulting segmentation u strongly depends on the selection 
of /?. If /? is small, then small-scale objects will also be detected. On the other hand, 
if /3 is large, then only large-scale objects or objects formed by grouping smaller scale 
features are usually detected. This behavior can be seen from the results depicted in 
Figure E^l 

Although the original problem (12.11) is nonconvex, its relaxation (|2.2I) is convex. 
The penalized optimization problem ((P)) . which is solved by our algorithm, is even 
strictly convex. Hence, it has a unique global minimizer. However, the model (|4.4I) is 
non-convex with respect to (m, ci, C2), even after relaxation. But numerically in Figure 
14.31 we demonstrate that our algorithm works reliably and converges to the same 
minimizer, regardless of the initial choice w*^. We also mention that generalizations of 
the model (I4.4p as proposed in [3D] may be treated with our approach, too. 
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(a) (b) (c) 

Fig. 4.2. The results u for Mumford- Shah-based image segmentation with different choices 
of p. (a) The test images f for segmentation, (b) the results u with P — 8 x 10~'^, (c) the 
results u with /3 = 8 x 10~^. 



4.2.1. Multi-phase piecewise constant Mumford-Shah model for multi- 
labeling. Instead of only separating objects from a background, the multi-phase 
piecewise constant Mumford-Shah image segmentation problem distinguishes between 
more than two sets Q.i with associated values i.e., it formulates multi-labeling. 
In this context, we want to represent the input image / b^ a piecewise constant 
function /^"^ : — > {ci, • • • , c^v} with the partitioning VL = [J^^i and the property 
that f^'^{x) = Ci if x G ili for i = I,-- - ,N. Introducing an indicator function 
u, e BV{n; {0, 1}) of the set fl, for each i e {1, • • • , iV}, we have fP" = Y^fLi c^"i■ 
Extending the two-phase piecewise constant Mumford-Shah model (|4.4|) to the multi- 
phase case, we utilize the multi-labeling model 

min y / u, (c, - /)2 dx + /3 y J{u,), (4.5) 

uG i3y(O;{0, i})",ceR" ~l 

T.'iLi "i(^) = 1 for all i; e O 

where u = {ui, • • • , mat)^ and c — (ci, • • • , cn)^ ■ In fact, (|4.5p is a continuous variant 
of the Potts model [27j with total variation regularization as perimeter penalization. 

Considering the convex relaxation of (j4.5p in a rather straightforward manner 
gives 



N ^ N 



mm > / (c, -f)^dx + f3Y] ^M- (4-6) 

US BV(n;[0, 1])",cGR" ~l J ^ ~~{ 
= 1 for all a; G n 

We emphasize that this relaxation is known to be not necessarily exact; see |33] for 
more details in this vein. As a consequence, it is indeed possible that the minimum 
of (|4.6p is strictly less than the minimum of (j4.5p . and, thus, in general one cannot 
expect to obtain a minimizer of (|4.5I) from a minimizer of (j4.6p based on Theorem 
2.21 Further we mention that recently various research efforts have been devoted to 
studying and solving the multi- labeling problem; see [211 HH IH [H] ■ 
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► 



(b) 



(c) 



► 

n 

► 

n 

(d) 



Fig. 4.3. The results u for Mumford-Shah-based image segmentation with different initial 
values u° and (3 = 10~^. (a) The test image f for segmentation, (b) the initial m°, (c) , 
(d) the results u. 



In order to provide an exact convex relaxation context for the multi-phase case, 
we assume that the number of phases is equal to or less than 2*^. Referring to the 
work in [32' , such a situation can be represented by M indicator functions to represent 
the partitioning. As a consequence, the model (|4.4p can be generalized to 



mm 

aGBy(n;{o,i})*^,cei 



. M 
E / dx + /3 Ej(u.), (4.7) 



fte{o,i}^* 

where u = (mi, • • • , um)^ with Ui e BV{Vi\ {0, 1}) for each i = 1, • • • , M, and 

1=1 



y, 



if b, - 1. 



It is straightforward to prove that (|4.7I) has an exact convex relaxation (in the sense 
of Theorem [ 



mm 

u^BV{Q.-[0,l\Y' ,ce9?^ 



E / Z~^{u){c^^ -ffdx + pY. JM, (4.8) 

and a solution of (j4.7p can be obtained from a solution of (|4.8p in the same way as in 
Theorem 12.21 We solve (|4.8p iteratively as follows; For fc > and for i = 1, 2, . . . , M, 
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each is computed by fixing uj = Uj for all 1 < j < i and Uj — for i < 

j < M. With such a setting, the minimization problem resulting from (I4.8|) fits into 
our framework (j2.ip . Hence, we can employ our proposed solution algorithm. In 
Figure 1131 we show an example for labelling an image by 2 and 3 indicator functions, 
respectively. 

Although the objective function in (14.71) is convex for each Ui with all the other 
Uj {j ^ i) fixed, the associated overall minimization problem is of course not convex. 
Thus, the algorithm is not guaranteed to converge to a global minimizer. Moreover, 
the obtained results may depend on the initial choice of u. In order to show this 
influence of the initialization we depict several different initial choices together with 
the obtained solutions in Figure l475l 

4.3. Further applications. In this section, we provide two more applications 
fitting our exact relaxation framework, and show their relation to the full model (jl.ip . 

4.3.1. Optimal design of composite membranes. A well-known problem 
in the design of composite membranes (cf. [T^l [13]) is the minimization of the first 
eigenvalue A° of the elliptic problem 



— Av + auv — X°v 



in a domain D, subject to homogeneous Dirichlet boundary conditions on 951 and a 
volume constraint J^u dx = V, where v is the eigenfunction. The minimization is 
carried out with respect to u being the indicator function of some subset of 51, which is 
termed an optimal configuration for the data {ft, a, V). If we use additional standard 
perimeter penalization and the Rayleigh variational principle for the first eigenvalue, 
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Fig. 4.5. The results f^'^ from solving (|4.7p with different initial values u. (a) the initial 
u\, (h) the initial U2, (c) the results f^". 



then the associated optimization problem becomes 

7° !n {\^^? + auv'^ 



mm 

{u,v)eBV(n:{o,i}}xHl,{n) 



In 



/3 J{u) 



(4.9) 



s.t. 



dx < V, 



for fixed 7° > 0. Here, H^i^) denotes the Sobolev space with distributional deriva- 
tives in L^{U,Y and vanishing trace on d^. 

The problem fits into the framework of (|f .f I) with the choice V = Hq{VI), the 
functional 



F{v) 



7° /o (IV^P) 
2 J^v^Ax 



and the operator 



2 /o «^ da 



Note that due to the compact embedding of Sobolev spaces, G is a strongly continuous 
operator from V to L^(r2). Hence, we conclude that a minimizer of the composite 
membrane problem can be computed by solving the relaxed problem 



mm 

(«,w)6By(n;[o,i])x_Hi(n) 

s.t. / u dx <V, 
Jn 



7° In + auv'^) dx 



/3 J{u) 



and subsequently thresholding to 

u\x) 



1, if u > i, 
0, else. 
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4.3.2. Minimal compliance topology optimization. A particularly impor- 
tant problem in structural topology optimization is the minimal compliance problem 
(cf. |H15j), which can be written as 



min l(v) 

{u,v)eBV(n;{o.i})xH}y{nM'^) 

s.t. au{v,v) ^ l{v), for all u e 
u dx <V, 



(4.10) 



where u denotes the (material) density, v is the elastic displacement, and i?^(r2;K'') 
is the subspace of functions v € H^{U; R"*) with vanishing trace on Fd C dfl with dfl 
the boundary of the bounded domain 11. In addition we have 



l{v) := f V dx + t V ds, 
Jn J Ft 



au{v,v) := / u {E{v) : C : E{i)) dx, 
Jn 

where / are body forces, t is the boundary traction on the traction part Ft C d^l, C 
denotes a given elasticity tensor, and E{v) := ^{Vv + Vv^). The main idea of the 
formulation is to maximize the material stiffness at a given volume. 

Our algorithm can be applied in the minimal compliance context upon approxi- 
mating (|4.10|) by 



min l{v) + II 

{u,v)GBV{n-{0,l})xHly{n-R'') 



l{v) - / u{E{v) : C : E{v)) dx 
n 



(4.11) 



s.t. / u dx <V, 
Jn 



where /i > represents a large penalty parameter. Since the problem (|4.1ip is (still) 
ill-posed, approximate regularized problems are solved in practice in order to compute 
a stable topological design. In this direction, besides material interpolation schemes 
with penalization (cf. [S]), filter methods and perimeter penalization are of particu- 
lar importance (cf. [5l[6]). If if is a suitable nonnegative and smooth filter function 
approximating the Dirac-delta distribution, and e G (0, 1) is a small parameter to pre- 
vent the possible singularity of the equilibrium problem, then the regularized problem 
takes the form 



mm n 

(tt,?))GBV(i2:{0,l})xffi,(n;R<*) 



l{v) - I {e + {l-e){Kir u)){E{v) : C : E{v)) dx 
n 



+ I{v) + I3 J{u) 

s.t. / u dx <V, 
Jn 

where denotes convolution. 

We are now able to define the functional F : H}j{D.; W^) 

F{v) := (1 + ii)l{v) -ne I E{v) : C : E{v) dx, 

Jn 
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as 



(4.12) 



and the operator G : H}j{n;R'^) 



{il) as 



G{v)u : 



- e) [ (Ki^u) {E{v) : C : E{v)) dx. 



It is easy to see that F and G satisfy the above conditions, and hence, minimizers of 
the minimal compliance problem can be obtained as level sets of minimizers of the 
relaxed problem 



5. Conclusion. The exact relaxation framework considered in this paper allows 
to relax a class of minimization problems for binary valued unknowns in the space of 
functions of bounded variation to a convex, constrained minimization problem for a 
continuously valued unknown function. The solution of the original problem is almost 
surely gained back by computing level sets of the solution to the relaxed problem. This 
procedure is stable under certain data perturbations and allows to account for volume 
constraints, which are important, for instance, in minimal compliance problems in 
topology optimization. 

As it has been already observed for classical total variation regularization with 
L-^- or L^-data fidelity terms ([16l [20l [22]) inexact variants of semismooth Newton 
methods allowing for rather inexact solutions of the associated linear systems are 
efficient solvers also for problems of the underlying type. This is demonstrated in this 
work for examples of binary image denoising and multi-phase image segmentation. 

The exact relaxation framework (in analysis and numerics) allows to treat various 
application classes. Besides image processing (denoising, segmentation, labelling), the 
design of composite membranes and the computation of topological structures with 
minimal compliance fall into this category. 



[1] R. Acar and C. R. Vogel. Analysis of total variation penalty methods. Inverse Problems, 
10:1217-1229, 1994. 

[2] G. Allaire. Shape Optimization by the Homogenization Method. Springer, New York, 2002. 
[3] L. Ambrosio. Metric space valued functions of bounded variation. Ann. Scuola Norm. Sup. 

Pisa CI. Sci. (4), 17(3):439-478, 1990. 
[4] E. Bae, J. Yuan, and X.-C. Tai. Global minimization for continuous multiphase partitioning 

problems using a dual approach. International Journal of Computer Vision, 92(1):112-129, 

2011. 

[5] M. P. Bends0e and O. Sigmund. Topology Optimization. Springer, Berlin, 2002. 

[6] B. Bourdin. Filters in topology optimization. International Journal for Numerical Methods in 

Engineering, 50:2143-2158, 2001. 
[7] J. V. Burke. An exact penalization viewpoint of constrained optimization. SIAM Journal on 

Control and Optimization, 29:968-998, 1991. 
[8] A. ChamboUe and P-L. Lions. Image recovery via total variation minimization and related 

problems. Numerische Mathematik, 76:167-188, 1997. 
[9] T. F. Chan and S. Esedoglu. Aspects of total variation regularized function approximation. 

SIAM Journal on Applied Mathematics, 65:1817-1837, 2005. 
[10] T. F. Chan, S. Esedoglu, and M. Nikolova. Algorithms for finding global minimizers of image 

segmentation and denoising models. SIAM Journal on Applied Mathematics, 66(5):1632- 




(4.13) 




REFERENCES 



1648, 2006. 



24 



[11] T. F. Chan and L. Vese. Active contours without edges. IEEE Transactions on Image Pro- 
cessing, 10(2):266-277, 2001. 
[12] S. Chanillo, D. Grieser, M. Imai, K. Kurata, and I. Onishi. Symmetry breaJcing and other 

phenomena in the optimization of eigenvalues for composite membranes. Communications 
in Mathematical Physic.% 214:315-337, 2000. 

[13] S. J. Cox and J. R. McLauglilin. Extremal eigenvalue problems for composite membranes, i 
and ii. Applied Mathematics and Optimization, 1990. 

[14] D. Cremcrs, T. Pock, K. Kolcv, and A. ChamboUc. Convex relaxation techniques for segmen- 
tation, stereo and multiview reconstruction. In Markov Random Fields for Vision and 
Image Processing. MIT Press, 2011. 

[15] M.C. Delfour and J. -P. Zolesio. Shapes and geometries. Analysis, differential calculus, and 
optimization. SIAM, Philadelphia, 2001. 

[16] Y. Dong, M. Hintermiiller, and M. Neri. An efficient primal-dual method for L^TV image 
restoration. SIAM J. Imaging Sci., 2(4):1168-1189, 2009. 

[17] I. Ekeland and R. Temam. Convex Analysis and Variational Problems. Classics in Applied 
Mathematics 28, SIAM, Philadelphia, 1999. 

[18] W. H. Fleming and R. Rishel. An integral formula for the total gradient variation. Archiv der 
Mathematik, ll(l):218-222, 1960. 

[19] E. Giusti. Minimal Surfaces and Functions of Bounded Variation. Birkhauser, Boston, 1984. 

[20] M. Hintermiiller, K. Ito, and K. Kunisch. The primal-dual active set strategy as a semismooth 
Newton method. SIAM J. Optim., 13:865-888, 2002. 

[21] M. Hintermiiller and K. Kunisch. Total bounded variation regulaxization as bilaterally con- 
strained optimization problem. SIAM Journal on Applied Mathematics, 64:1311-1333, 
2004. 

[22] M. Hintermiiller and G. Stadler. An infeasible primal-dual algorithm for total bounded 
variation-based inf-eonvolution-type image restoration. SIAM J. Sci. Comput., 28{l):l-23, 
2006. 

[23] J.-B. Hiriart-Urruty and C. Lemareehal. Convex Analysis and Minimization Algorithms. 
Sijringer, 1993. 

[24] J. Lellmaim and C. Schnorr. Continuous nmlticlass labeling approaches and algorithms. SIAM 
Journal on Imaging Sciences, 4(4): 1049 1096, 2011. 

[25] D. Mumford and J. Shah. Approximation by piecewise smooth functions and associated varia- 
tional problems. Communications on Pure and Applied Mathematics, 42:577-685, 1989. 

[26] T. Pock, D. Cremers, H. Bischof, and A. ChamboUe. Global solutions of variational models 
with convex regularization. SIAM Journal on Imaging Sciences, 3(4):1122-1145, 2010. 

[27] R. B. Potts. Some generalized order-disorder transformations. In Proceeding of the Cambridge 
Philosophical Society, volume 48, pages 106-109, 1952. 

[28] L. I. Rudin, S. Osher, and E. Fatemi. Nonlinear total variation based noise removal algorithms. 
Physica D, 60:259-268, 1992. 

[29] W. Rudin. Functional Analysis. International Series in Pure and Applied Mathematics. 
McGraw-Hill, 2nd edition, 1991. 

[30] A. Sawatzky, D. Tenbrinck, X. Jiang, and M. Burger. A variational framework for region-based 
segmentation incorporating physical noise models. CAM Report 11-81, April 2012. 

[31] E. Strekalovskiy, A. ChamboUe, and D. Cremers. A convex representation for the vectorial 
Mumford-Shah functional. In IEEE Conference on Computer Vision and Pattern Recog- 
nition (CVPR), Providence, Rhode Island, June 2012. 

[32] L. A. Vese and T. F. Chan. A multiphase level set framework for image segmentation using 
the Mumford and Shah model. International Journal of Computer Vision, 50(3):271-293, 
2002. 

[33] C. Zach, D. Gallup, J.-M. Frahm, and M. Niethammer. Fast global labeling for real-time stereo 
using multiple plane sweeps. In Vision, Modeling and Visualization Workshop (VMV), 
2008. 



25 



